Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization
نویسندگان
چکیده
The problem of computing category agnostic bounding box proposals is utilized as a core component in many computer vision tasks and thus has lately attracted a lot of attention. In this work we propose a new approach to tackle this problem that is based on an active strategy for generating box proposals that starts from a set of seed boxes, which are uniformly distributed on the image, and then progressively moves its attention on the promising image areas where it is more likely to discover well localized bounding box proposals. We call our approach AttractioNet and a core component of it is a CNN-based category agnostic object location refinement module that is capable of yielding accurate and robust bounding box predictions regardless of the object category. We extensively evaluate our AttractioNet approach on the COCO 2014 validation set as well as on the PASCAL VOC2007 test set, reporting for both of them state-of-the-art results that surpass the previous work in the field by a significant margin. Finally, we provide strong empirical evidence that our approach is capable to generalize to unseen categories. Project page:: https://github.com/gidariss/AttractioNet.
منابع مشابه
Boundary-aware box refinement for object proposal generation
Object proposals have been widely used in object detection to speed up object searching. However, many of existing object proposal generators have pool localization quality, which weakens the performance of object detectors. In this paper, we present an effective approach to improve the localization quality of object proposals. We leverage the boundary-preserving property of superpixels and des...
متن کاملDeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images
In this paper, we develop a novel unified framework called DeepText for text region proposal generation and text detection in natural images via a fully convolutional neural network (CNN). First, we propose the inception region proposal network (InceptionRPN) and design a set of text characteristic prior bounding boxes to achieve high word recall with only hundred level candidate proposals. Nex...
متن کاملThe relationship among changes in microstructure, active sites behavior and properties in the propylene polymerization with a 4th generation Ziegler-Natta catalyst
Three polypropylene samples (1-3) were synthesized with a 4th generation Ziegler-Natta catalyst in the presence of cyclohexyldimethoxymethylsilane (donor c), dicyclopenthyldimethoxysilane (donor d) and diisopropyldimethoxysilane (donor p), respectively, as external electron donors. The physical properties of the synthesized polypropylenes were determined. For samples 1 to 3, Successive self-nuc...
متن کاملToo Far to See? Not Really! - Pedestrian Detection with Scale-aware Localization Policy
A major bottleneck of pedestrian detection lies on the sharp performance deterioration in the presence of small-size pedestrians that are relatively far from the camera. Motivated by the observation that pedestrians of disparate spatial scales exhibit distinct visual appearances, we propose in this paper an active pedestrian detector that explicitly operates over multiple-layer neuronal represe...
متن کاملCATS: Co-saliency Activated Tracklet Selection for Video Co-localization
1) Co-saliency Generation: Appropriate neighboring warped saliency maps are fused with the saliency maps of activators to generate different co-saliency maps. These maps are then similarly fused through averaging for generating eventual co-saliency object prior (O). 2) Bounding-Box Filtering: Co-saliency object prior helps in filtering out noisy bounding box proposals and keep good proposals. 3...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1606.04446 شماره
صفحات -
تاریخ انتشار 2016